Processor subsystem interconnect architecture for a large symmetric multiprocessing system
نویسندگان
چکیده
Integral to the significant capacity growth of the IBM eServer z990 (the eighth-generation zSeries CMOS-based server) from its predecessor z900 system is the interconnect architecture, which tightly couples 48 customer CPUs in the system. A major attribute of this architecture is a new “hot swap” feature which improves zSeries system availability for customers by permitting the substitution or addition of a field-replaceable unit (FRU) in the processor subsystem without requiring the system to be powered down. The novel two-level interconnect architecture contains a distributed switch which connects up to four processor-memory nodes in book packages. The book packages, which are also FRUs, are connected in a dual concentric ring topology at the second-level (L2) interconnect. This architecture also contains an integrated 32-MB L2 cache and central switch connecting up to eight dual-core processor chips in a star topology at the first-level interconnect inside one of these nodes. This paper describes the bus protocol on the second-level interconnect, the cache coherency management throughout the storage hierarchy, and the ring topology reconfiguration for hot swap. Also described is a memory power management scheme to support the power demand from the 48 CPUs and up to 256 GB of memory.
منابع مشابه
A Practical Symmetric Multi-Processor Architecture Design Study using Optical Multi-Drop Networks
This paper presents a design study for a scalable symmetric multi-processor (SMP) server computer design that is implemented using optical components. The design heavily exploits the capability of optics to support high-bandwidth multi-source, multi-drop transmission (optical broadcast), since such a large portion of the intra-SMP traffic can benefit from broadcast signal transmission to assure...
متن کاملHigh Performance I/O Design in the AlphaServer 4100 Symmetric Multiprocessing System
ing system based on the Alpha 21164 64-bit RISC microprocessor. This midrange system supports one to four CPUs, one to four 64-bit-wide peer bridges to the peripheral component interconnect (PCI), and one to four logical memory slots. The goals for the AlphaServer 4100 system were fast CPU performance, low memory latency, and high memory and I/O bandwidth. One measure of success in achieving th...
متن کاملServer I/O Acceleration Using an Embedded Multi-core Architecture
This paper presents a feasibility study on the use of an embedded multi-core system-on-a-chip (SoC) architecture to accelerate server I/O subsystem functions, as an alternative to implementation via finite state machines (FSMs) and hardwired logic. The multi-core solution is significantly more programmable than FSMs and avoids many of their shortcomings. For the purposes of this SoC we use the ...
متن کاملSystem emulation with QEMU
QEMU is an open source emulator for complete PC systems. In addition to emulating a processor, QEMU permits emulation of all necessary subsystems, such as networking and video hardware. It also permits emulation of advanced concepts, such as symmetric multiprocessing systems (up to 255 CPUs) and other processor architectures, such as ARM or PowerPC. This article explores QEMU and its architectu...
متن کاملApplication Performance on the Tri-Lab Linux Capacity Cluster - TLCC
In a recent acquisition by DOE/NNSA several large capacity computing clusters called TLCC have been installed at the DOE labs: SNL, LANL and LLNL. TLCC architecture with ccNUMA, multi-socket, multi-core nodes, and InfiniBand interconnect, is representative of the trend in HPC architectures. This chapter examines application performance on TLCC contrasting them with Red Storm/Cray XT4. TLCC and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IBM Journal of Research and Development
دوره 48 شماره
صفحات -
تاریخ انتشار 2004